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Abstract 

This technical report describes a simple and efficient method for expanding q -bit integer values 
to m -bit integer values ( m > q ) by means of bit replication. First, we show that the optimal 
number of repetitions is given by ceilingjm/ q} and that the method is equivalent to 
multiplication by the ideal gain when ml q is an integer. We then demonstrate that, in the case 
where ml q is not an integer, truncating the fraction bits to the right of the decimal point will lead 
to zero average error. The report also includes two suggestions for implementing the bit- 
replication process, both of which have a vast complexity advantage over a multiplier. Two 
examples are given at the end to illustrate the bit-replication process in action. 
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Introduction 

There are several applications where the set of values within a particular range needs to be 
operated on at a larger, or higher-precision, range. In particular, we will consider the case where 
a q -bit integer input level L, with values from the range 

L, £{ 0,1,2,. ...2«-l} 

is to be scaled to a m -bit integer output level L a with values from the range 

L oG {0,...,2 m -l}. 

This can be achieved by performing a multiplication by an ideal gain, 

2 m -l 

G = 

2 q -l 

followed by rounding to the nearest integer: 

L a = Round{Gx L ; }. 

In this paper we will show that this ideal method can be closely approximated with a much 
simpler implementation of bit replication. We will also determine the optimal number of bits to 
replicate for minimum error. 

Series Expansion 

For any real numbers a and b , where b is not equal to 0 or 1, the follow expression is valid: 

a — \ _a (a/b) — l 
b-l~ b b-l 

This expression can itself be used to expand the rightmost term further: 

(alb)-\ _ a (alb 2 )-\ 
b-l ~ b 2 b-l 

Recursively repeating this expansion N times results in the series: 

a - 1 \-i a | {alb )-l 



1 _ V 1 a 



b-l 



In our case, using a = 2 m and b = 2 q , the ideal gain can be expressed as 



m—iq 



i=l 

From this, we define the approximate gain to be 



2 9 -l 



5 



Q = Y^ 2 m - iq , 



1=1 



and the gain-error to be 



E = 



1-2 



m—Nq 



2 q -l 



For the purpose of analysis, let us assume that the output level L o can be a real number (instead of 
an integer) and the rounding operation is unnecessary. The data scaling will therefore be 
implemented as follows: 

L 0 =GxL ; . 

Under these assumptions, the gain-error E is proportional to the error in the output level. 

Bit Replication 

We can show that multiplication by the approximate gain G can be implemented simply as a 
replication of the input level bits. Since each term in G is a power of 2, multiplication is 
equivalent to a shift of the input. Note also that the shifts corresponding to successive terms 
differ by q spaces, which is the same as the number of bits in the input. This means that there is 
no overlap of bits and no addition is necessary. Consider the example of converting a 3-bit input 
( q = 3 ) to an 8-bit output ( m = 8 ). In Figure 1 the values of the 3 input bits are "A", "B", and 
"C" as shown. If the sum was carried out to 5 terms or repeat periods (N = 5), the output value 
would have the bit values as indicated. This bit replication extends into the fraction bits to the 
right of the decimal point. 
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Figure 1. Example of scaling a 3 -bit input value to 8 bits 
using the approximate gain G with N = 5. 
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Figure 2. Representation of a real number with binary digits. 

The weighting of the bit values in this representation is shown in Figure 2. Multiplying an 
input by the approximate gain will result in a real number with an integer part and a fraction part. 
Although the goal is to generate a m -bit integer output value, it may be useful to keep additional 
fraction bits for intermediate computations of the output value for higher accuracy. 

In the next sections two aspects of the bit-replication process will be optimized in terms of 
minimizing the error. First, the optimal value of N is determined. Then for cases where the last 
repeat periods have both integer and fraction bits (as in the third period of Figure 1) the 
contribution of the fraction bits to the accuracy of the output level is found. 

Optimization of Repeat Periods 

The optimal number of repeat periods N can be determined simply by solving for the value 
of N that minimizes \E\ . \E\ is zero exactly once when m- Nq = 0 , so the optimal value should 
be 

N = ml q . 

The problem is that N can only take on integer values so this result will only occur when q is 
an exact multiple of m . However, if this condition does hold, the error is identically zero, and bit 
replication yields a perfectly scaled output level. 

For the case where (ml q) is not an integer, the solution for the optimum N is more 
involved. In Figure 3 \E\ is plotted as a function of N . The graph reveals two candidate integer 
values for N on either side of the point (mlq): N L0 is the nearest integer less than (ml q) and 
N HI is the nearest integer greater than (mlq). Since the denominator of \E\ is constant with 
respect to N , we can consider the equivalent problem of minimizing the function 

D _ I ^m-Nq _ i I 
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Figure 3. Absolute error as a function of N . 

For N <ml q , (3 increases monotonically with decreasing N because the exponent 
(m-Nq) becomes increasingly positive. This exponent is a positive integer, so the smallest [3 
can be is 1. 

For N >ml q , (3 increases monotonically with increasing N because the exponent 
{m-Nq) becomes increasingly negative. ( asymptotically approaches 1/ (2 q — 1) . ) The 
exponent (m— Nq) is a negative integer, so (3 < 1 . 

Therefore, of the two candidates, N L0 and N HI , the value of N that minimizes \E\ is N HI . 
This is the nearest integer larger than the non-integer value (ml q) , or equivalently, 

N = ceilingjm/g} . 

Contribution of Fraction Bits 

Consider the case of q = 5 and m = 8 . The optimal number of repeat periods is 

Af = ceiling{8/5} = 2. 

For a 5-bit input, the output is as shown in Figure 4. There are 2 fraction bits for this particular 
example. In all cases, the number of fraction bits due to repeating bits by the optimum N periods 
is 
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Figure 4. Scaling a 5-bit number to an 8-bit number with N = 2 repeat periods. 

b = Nq-m. 

Note that there are no fraction bits for the error-free case where {ml q) is an integer. 

In the following, we derive the average contribution of the fraction bits over all possible input 
levels. Since the fraction bits are simply the b least significant bits of each input level L, , they 
will take on the values {0,1,2,..., 2 h -1} in a periodic fashion as L, increases. Note that there will 
be 2 q ~ b complete periods in the range of L ( . The average contribution of the fraction bits over all 
L, is therefore the same as the average contribution over one single period. 

Let the value of the fraction bits be i and the contribution of the fraction bits to the output be 
F . We can then write 

F = ±- 



and the average contribution of the fraction bits to the output is therefore given by 

1 



Using the fact that 



the above expression simplifies to 



F = 



•>b ^( r,b 



L 1=0 z 



1=0 



M — \ 



F = 



1-2" 



Next, we derive the average output error due to the gain-error over all possible input levels. 
Using the change of variables b = Nq-m , we can rewrite the gain-error to be 

\-2~ b 



E = 



2 q -l 
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Figure 5. Output error as a function of input, 
and the contribution of fraction bits to the output value. 

For each input level L, , the output error e is E x L t ■ . The average output error is then given by 



i 2"-l 



e 

2 



L,=0 



7 q -1 
V z 1 J 



1-2 



For our example of q = 5 and m = 8 , the contribution of the fraction bits F and output error e 
are plotted in Figure 5. 

It is very interesting to note that for all cases, 

e = F . 

By this we can conclude that zero average error can be achieved by always truncating the fraction 
bits! Using our example of q = 5 and m = 8 .again, the output error is plotted in Figure 6 for the 
cases of no fraction bits used, one fraction bit used, and both fraction bits used. The average 
errors are 0.000, 0.250, and 0.375 ( = (1-2" 2 ) /2 ) respectively. 

Implementation 

We have proved that multiplication of a q -bit input level L, by the ideal gain 
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Figure 6.0utput error as a function of number of fraction bits retained. 




2 ? -l 



can be achieved with zero average error by simply replicating the input bits from the most 
significant part of the m -bit output register to the least significant part. Using additional fraction 
bits for intermediate computation can only increase average error. 

The most straightforward hardware implementation of this bit replication method is the wired 
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Figure 7. Wired implementation. 
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Figure 8. Shift-register implementation 

approach as illustrated in Figure 7. In this figure we use the case of q = 3 and m = 8 as an 
example. "A", "B", and "C" represent input bit values. 

Another implementation can use shift registers as depicted in Figure 8. The input register 
performs a circular shift left into an output shift register as shown. The operation is complete 
after m shifts. 

Either case has a tremendous complexity advantage over a multiplier. 

Examples 

We will end this document by using two examples to illustrate the bit-replication process in 
action. The first example is the case of q = 5 and m — 8 . In Table 1 , we show, for each possible 
input level, the actual output level and its difference from the ideal output level. Note that the 
average error over all possible input levels is zero. Table 2 shows the case of q = 4 and m = 12 . 
In this case, ml q is an integer, so the bit-replication gives the exact answer and the error is zero 
for all input levels. 
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Table 1 : The Input Levels, the Output Levels, and the Errors 
for the Case of q = 5 and m = 8 
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Table 2: The Input Levels, the Output Levels, and the Errors 
for the Case of q = 4 and m = 12 
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